Discover Relevant Environment Feature Using Concurrent Reinforcement Learning
نویسندگان
چکیده
In order to compare the policies more efficiently, we introduce a new reinforcement learning method called concurrent biased learning. This is a multi-thread learning method, in which each learning thread refers to one feature of the environment. If an agent intentionally focuses on part of these environmental features to learn a policy of a task, we call this method a biased learning; otherwise, if an agent uses all features that it perceives to learn a task, we call this unbiased learning. We present a method concerning the relevance of information in order to improve the learning of a reinforcement learning robot. We introduce a new concurrent online learning algorithm to calculate the contribution C(s) and relevance degree I(s) to quantify the relevancy of features with respect to a desired learning task. Our analysis shows that the correlation relationship of the environment features can be extracted and projected to concurrent learning threads. By comparing the contribution of these learning threads, we can evaluate the relevance degree of a feature when performing a particular learning task. ( ) i s π If the agent learns a policy with respect to one of the environmental features , we call this biased learning with respect to feature i . The biased Q-value can be denoted as . Then the Bellman update function of the biased Q-value is: i
منابع مشابه
Running head : AGING , REINFORCEMENT LEARNING AND ATTENTION 1 The effects of aging on the interaction between reinforcement learning and attention
Reinforcement learning (RL) in complex environments relies on selective attention to uncover those aspects of the environment that are most predictive of reward. While previous work has focused on age-related changes in RL, it is not known whether older adults learn differently from younger adults when selective attention is required. In two experiments, we examined how aging impacts on the int...
متن کاملThe effects of aging on the interaction between reinforcement learning and attention.
Reinforcement learning (RL) in complex environments relies on selective attention to uncover those aspects of the environment that are most predictive of reward. Whereas previous work has focused on age-related changes in RL, it is not known whether older adults learn differently from younger adults when selective attention is required. In 2 experiments, we examined how aging affects the intera...
متن کاملLearning Visual Feature Spaces for Robotic Manipulation with Deep Spatial Autoencoders
Reinforcement learning provides a powerful and flexible framework for automated acquisition of robotic motion skills. However, applying reinforcement learning requires a sufficiently detailed representation of the state, including the configuration of task-relevant objects. We present an approach that automates state-space construction by learning a state representation directly from camera ima...
متن کاملA multiagent architecture for concurrent reinforcement learning
In this paper we propose a multiagent architecture for implementing concurrent reinforcement learning, an approach where several agents, sharing the same environment, perceptions and actions, work towards one only objective: learning a single value function. We present encouraging experimental results derived from the initial phase of our research on the combination of concurrent reinforcement ...
متن کاملRRLUFF: Ranking function based on Reinforcement Learning using User Feedback and Web Document Features
Principal aim of a search engine is to provide the sorted results according to user’s requirements. To achieve this aim, it employs ranking methods to rank the web documents based on their significance and relevance to user query. The novelty of this paper is to provide user feedback-based ranking algorithm using reinforcement learning. The proposed algorithm is called RRLUFF, in which the rank...
متن کامل